Modeling Visual Compatibility through Hierarchical Mid-level Elements
نویسندگان
چکیده
In this paper we present a hierarchical method to discover mid-level elements with the objective of modeling visual compatibility between objects. At the base-level, our method identifies patterns of CNN activations with the aim of modeling different variations/styles in which objects of the classes of interest may occur. At the top-level, the proposed method discovers patterns of co-occurring activations of baselevel elements that define visual compatibility between pairs of object classes. Experiments on the massive Amazon dataset show the strength of our method at describing object classes and the characteristics that drive the compatibility between them.
منابع مشابه
Hierarchical Implicit Shape Modeling
In this paper, a new hierarchical approach for part-based object recognition is proposed. Object detection methods based on Implicit Shape Model (ISM) efficiently handle deformable objects, occlusions and clutters. The structure of each object in ISM is defined by a spring like graph, hence parts independently vote to object properties. We introduce hierarchical ISM in which structure of each o...
متن کاملModeling Mid-level Visual Representations through Clustering in a Convolutional Neural Network
The nature of visual properties used in cortical perception is subject to considerable ongoing study. Features of intermediate complexity are particularly uncertain. Convolutional Neural Network (CNN) models, however, have proven to be quite effective in modeling human vision (Yamins et al., 2014) and have performed with great accuracy on image classification tasks (Krizhevsky et al., 2012). St...
متن کاملLearning Discriminative Visual N-grams from Mid-level Image Features
Mid-level image features have been shown to be helpful to bridge the semantic gap between low-level and high-level image representations. Many existing methods to learn mid-level visual elements consider each mid-level feature individually, and do not take their mutual relationships into account. We follow the intuitive idea that learning discriminative combinations of visual elements can help ...
متن کاملVISDA: an open-source caBIGTM analytical tool for data clustering and beyond
SUMMARY VISDA (Visual Statistical Data Analyzer) is a caBIG analytical tool for cluster modeling, visualization and discovery that has met silver-level compatibility under the caBIG initiative. Being statistically principled and visually interfaced, VISDA exploits both hierarchical statistics modeling and human gift for pattern recognition to allow a progressive yet interactive discovery of hid...
متن کاملVideo (GIF) Sentiment Analysis using Large-Scale Mid-Level Ontology
With faster connection speed, Internet users are now making social network a huge reservoir of texts, images and video clips (GIF). Sentiment analysis for such online platform can be used to predict political elections, evaluates economic indicators and so on. However, GIF sentiment analysis is quite challenging, not only because it hinges on spatio-temporal visual contentabstraction, but also ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1604.00036 شماره
صفحات -
تاریخ انتشار 2016